A semiparametric approach for marker gene selection based on gene expression data

نویسندگان

  • Zhong Guan
  • Hongyu Zhao
چکیده

MOTIVATION Identification of differentially expressed genes is a major issue in gene expression data analysis and selection of marker genes is critical in tumor classification using gene expression data. In this paper, we propose a semiparametric two-sample test to identify both differentially expressed genes and select marker genes for sample classification. RESULTS A simulation study shows that the proposed method is more robust and powerful than the methods, generally used such as t-tests and non-parametric rank-sum tests, when the sample size is small. Cross-validation shows that the sample classification based on genes selected using this semiparametric method has lower misclassification rates. CONTACT [email protected].

منابع مشابه

Robust high-dimensional semiparametric regression using optimized differencing method applied to the vitamin B2 production data

Background and purpose: By evolving science, knowledge, and technology, we deal with high-dimensional data in which the number of predictors may considerably exceed the sample size. The main problems with high-dimensional data are the estimation of the coefficients and interpretation. For high-dimension problems, classical methods are not reliable because of a large number of predictor variable...

متن کامل

Construction of a Mammalian IRES-based Expression Vector to Amplify a Bispecific Antibody; Blinatumomab

Blinatumomab, the bispecific T cell engager, has been demonstrated as the most successful BsAb to date. Throughout the past decade, vector design has great importance for the expression of monoclonal antibody in Chinese hamster ovary (CHO) cells. It has been indicated that expression plasmids based on the elongation factor-1 alpha (EF-1 alpha) gene and DHFR selection marker can be highly effect...

متن کامل

Construction of a Mammalian IRES-based Expression Vector to Amplify a Bispecific Antibody; Blinatumomab

Blinatumomab, the bispecific T cell engager, has been demonstrated as the most successful BsAb to date. Throughout the past decade, vector design has great importance for the expression of monoclonal antibody in Chinese hamster ovary (CHO) cells. It has been indicated that expression plasmids based on the elongation factor-1 alpha (EF-1 alpha) gene and DHFR selection marker can be highly effect...

متن کامل

SFLA Based Gene Selection Approach for Improving Cancer Classification Accuracy

 In this paper, we propose a new gene selection algorithm based on Shuffled Frog Leaping Algorithm that is called SFLA-FS. The proposed algorithm is used for improving cancer classification accuracy. Most of the biological datasets such as cancer datasets have a large number of genes and few samples. However, most of these genes are not usable in some tasks for example in cancer classification....

متن کامل

Gene Identification from Microarray Data for Diagnosis of Acute Myeloid and Lymphoblastic Leukemia Using a Sparse Gene Selection Method

Background: Microarray experiments can simultaneously determine the expression of thousands of genes. Identification of potential genes from microarray data for diagnosis of cancer is important. This study aimed to identify genes for the diagnosis of acute myeloid and lymphoblastic leukemia using a sparse feature selection method. Materials and Methods: In this descriptive study, the expressio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:
  • Bioinformatics

دوره 21 4  شماره 

صفحات  -

تاریخ انتشار 2005